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DMA SEQUENCE ENC ODING 
SQUALENE SYNTHETASE 

Background of the Invention 

The present invention relates to genetic 
engineering, particularly relating to nucleic acid 
sequences encoding enzymes useful in isoprenoid 
biosynthesis; and specifically to DNA sequences 
5 encoding enzymes involved in catalyzing the 
biosynthesis of squalene. 

Squalene is a 30 -carbon triterpene composed 
of six isoprene units. Squalene is an intermediate in 
the biosynthesis of sterols, such as, cholesterol. 

10 See, Bradfute et al . - J- Biol, Chem. , Vol- 267, p, 

18308 (1992) and Ericsson et al . , J. Biol, Chem,. Vol. 
267, p. 18708 (1992) • Squalene synthetase, an enzyme 
that resides at a branch in sterol biosynthesis, 
catalyzes the formation of squalene via reductive 

15 dimerization of the 15 -carbon intermediate farnesyl 
diphosphate (FPP) . This reaction is the first 
committed step in sterol biosynthesis. Alternatively, 
FPP may serve as substrate for numerous other 
prenyltransf erases leading to the prenylation of 

20 proteins, or to the formation of non- sterol isoprenoids 
such as dolichols, ubiquinones, and carotenoids. See, 
Kuswik-Rabiega et al . . CJ , Pj-^l > Chen^. , Vol. 262, p. 
1505 (1987), Summers et al . . gepe. Vol. 136, p. 185 
(1993); Keller et al . . Arch. Biochem, Biophys . , Vol. 

25 302, p. 304 (1993); LoGrasso et al . . Arcj->. Biog)i^m. 
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Biophys^ . Vol. 307, p. 193 (1993); McKenzie et al . . J, 
BiQ. Chgm,, Vol. 267, p. 21368 (1992); European Patent 
Application No. 486,290 and G.B. Patent Application 
Nos. 2,249,099 and 2,272,442. 
5 Squalene synthetase has been studied in 

mammalian systems, and is considered a key enzyme in 
the regulation of cholesterol biosynthesis. Thus, 
squalene synthetase has been studied as a possible 
target for the design of therapeutics aimed at 
10 controlling serum cholesterol levels. See, Robinson et 
al^. Mol. Cel l. Biol. . Vol.13, p. 2706 (1993). 
Inhibitors of the enzyme 3-hydroxy-3-methylglutaryl-CoA 
reductase (HMG-CoA reductase) have been shown to reduce 
serum cholesterol levels. See, Goldstein et al . . 
15 Nature, Vol. 343, p. 425 (1990) and Brown, et al . , 
Biol, Chem., Vol. 253, p. 1121 (1978). However, 
because HMG-CoA reductase mediates an early step in the 
isoprenoid pathway, the potential exists for the 
depletion of other key isoprenoids derived from 
20 mevalonic acid. Selective inhibition of squalene 
synthetase on the other hand, should not directly 
suppress biosynthesis of these non-- sterol isoprene 
metabolites. Consequently, inhibitors of squalene 
synthetase may have advantages over inhibitors which 
25 act further upstream in the pathway. A thorough 

knowledge of the structure and activity of this enzyme 
would greatly facilitate the design of specific 
modulators of squalene synthetase. 

Squalene has been found to have useful 
30 applications as coatings, as set forth in U.S. Patent 
Nos. 5,198,254 to Nisperos-Carriedo et al . and 
5,284,508 to Shibata et al . . in the cosmetics industry, 
as is set forth in U.S. Patent Nos. 4,699,930 to Suga 
and 5,116,607 to Jones; and in the pharmaceutical 
35 industry, as is set forth in U.S. Patent No. 4,806,352 
to Cantrell and 5,260,067 to Zheng Xu. Squalene 
synthetase inhibitors are set forth in Dufresne et al . . 
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Tetrahedron . Vol. 48, p. 10221 (1992); Hensens et 
Tetrahedron Let > . Vol. 34, p. 399 (1993) and U.S. 
Patent Nos, 5,250,424 to Bills et al . . 5,252,471 to 
Byrne et al . and 5,278,067 to Dawson et al , . 
5 It would be highly desirable to have the 

capability to alter those biosynthetic pathways which 
involve squalene, particularly in higher plants such as 
Solanaceae. As such, it would be desirable to provide 
nucleotide sequences that encode enzymes (e.g., 

10 sgualene synthetase) , which are useful in the 

biosynthetic pathway of squalene, and various other 
isoprenoids including sterols. In particular, it would 
be desirable to provide the nucleotide sequences that 
encode squalene synthetase from a higher plant species, 

15 such as a Nicotiana species. 

Summary of the Invention 
The present invention relates to nucleotide 
sequences (i.e., DNA and RNA) capable of encoding a 

20 polypeptide which has enzymatic activity for producing 
squalene. Such a polypeptide is referred to as. 
squalene synthetase. The DNA is isolated from a plant 
(e.g., a higher plant), in particular from a Nicotiana 
species (e.g., Nicotiana benthamiana ) ■ The nucleotide 

25 sequence encodes squalene synthetase, and variants of 
that sequence encodes enzymes exhibiting the same 
biological activity as squalene synthetase. The 
nucleotide sequence corresponds to, or substantially 
to, that sequence specified by SEQ ID NO: 1. The 

30 nucleotide sequence may be provided in an isolated, 
substantially pure form. 

In another aspect, the present invention 
relates to a polypeptide which may be provided in a 
purified (e.g., isolated and substantially pure) form. 

35 The polypeptide has enzymatic activity for producing 
squalene. That polypeptide is encoded by the 
nucleotide sequence which corresponds to, or 



wo 96/09393 




PCTAJS9S/11280 



-4- 



substantially to, that sequence specified by SEQ ID NO: 
1- The polypeptide has an amino acid sequence which 
corresponds to, or substantially to, the amino acid 
sequence specified by SEQ ID NO: 2. Polypeptides 
5 having amino acid sequences which correspond 

substantially to that encoded by the specified nucleic 
acid sequence have different amino acid sequences 
(e.g., a minor number of amino acids of the sequence 
can be deleted, added or substituted) , but the same 
10 type of biological activities as that encoded by the 
specified sequence, although those biological 
activities may differ in degree. 

In another aspect, the present invention 
relates to fragments of polypeptides (e.g., polypeptide 
15 derivatives) encoded by nucleotide sequences which 

correspond to, or substantially to, certain nucleotide 
subsequences contained within that sequence specified 
by SEQ. ID NO: 1. Such fragments represent domains of 
the full length (i.e., intact) polypeptides. Such 
20 fragments can be, for example, transit peptides useful 
for directing polypeptides to subcellular compartments, 
or polypeptide domains having properties such as 
catalytic activity, substrate binding activity, or the 
like. 

25 The nucleotide sequence of the present 

invention can be incorporated (i.e., in an operative 
fashion) into heterologous systems (e.g., yeast, 
bacteria or certain plants) in order that the 
respective polypeptide can be synthesized thereby. The 

30 nucleotide sequence can be incorporated into plants 
(e.g., rice, corn, tobacco or tomato) using 
transformation techniques or viral gene expression 
systems. The RNA molecules or polypeptides encoded by 
those nucleotide sequences can be used to alter the 

35 biosynthetic pathway of (i) sterols (e.g., 

cycloartenol, cholesterol, obtusifoliol, and related 
compounds), or (ii) other isoprenoids (e.g., squalene. 
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phytoene, lycopene, beta-carotene, and related 
compounds) • For example, the nucleotide sequence can 
be incorporated into a heterologous system in order to 
effect the accumulation in that system of various 
5 isoprenoids, including squalene- Alternatively, the 
polypeptides can be isolated in order to be used as 
enzymes in the in vitro synthesis of sterols or 
isoprenoids * 

In another aspect, the present invention 

10 relates to recombinant. DNA or RNA molecules. Such 
molecules include DNA sequences, corresponding RNA 
sequences, or subsequences of such DNA and RNA 
sequences. Certain sequences and subsequences (e.g,, 
promoters, enhancers, terminators and replication 

15 signals) are capable of facilitating the expression of 
squalene synthetase. A DNA sequence or at least one 
subsequence has a nucleotide sequence which is 
specified, or substantially specified, by at least a 
portion of that nucleotide sequence specified by SEQ ID 

20 NO: 1. The DNA sequence is isolated from a t^j-cot; j^^p^ 
\ ■ species, such as Nicotiana bent ham i ana. 

The recombinant molecule can be considered a 
plasmid or a vector. The recombinant molecule can be a 
plasmid or vector tailored for transfer of the 

25 recombinant molecule to plant or other cells. 

Recombinant molecules also can be contained in a 
transgenic plant cell, such as a tobacco plant cell. 

In another aspect, the invention relates to 
the delivery and expression of sequences or 

30 subsequences as described by SEQ ID NO: 1 via transient 
virus-based gene delivery systems. Such a system is 
described in European Patent Application No. 406,267 
and U.S. Patent No. 5,316,931, which are incorporated 
herein by reference. Such a system involves delivery 

35 of a selected nucleotide sequence as part of, or in 
conjunction with, a self -replicating DNA or RNA 
molecule (i.e., a virus), such that the exogenous gene 
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is replicated and expressed during the course of 
replication and expression of viral or virus-based 
nucleic acids and proteins. Such gene delivery systems 
may be used for expression of nucleic acid sequences 
5 or subsequences as described by SEQ ID NO: 1 in either 
sense orientation for the expression of polypeptides, 
or in ant i sense orientation for the delivery of RNA 
molecules capable of inhibiting expression of the 
target gene or other homologous genes. Genes or gene 
10 sequences delivered in such a manner are considered to 
be functionally inserted in the target organism such as 
a tobacco plant. 

A further aspect of the present invention is 
a crop composed of a plurality of plants having the 
15 nucleotide sequence of the present invention, or a 

subsec[uence thereof, functionally inserted therein; and 
planted together in an agricultural field, including a 
greenhouse or growth chamber. 

Compositions including genetic sequences and 
20 subsequences encoding enzymes for expression in plants, 
such as tobacco plants, impart those plants with the 
ability to produce altered levels of sterols and/or 
isoprenoids. As such, there is provided a method for 
altering the synthesis of sterols and isoprenoids in 
25 plants, which method involves inserting a recombinant 
genetic construct into plant cells. Such a construct 
can provide for synthesis of naturally occurring 
sterols (e.g., cycloartenol , obtusifoliol or 
cholesterol) and other isoprenoids (e.g., squalene, 
30 phytoene or beta -carotene) within such plant cells. 

For example, certain recombinant genetic constructs of 
the present invention are capable of increasing or 
decreasing expression of at least one naturally 
occurring enzyme in order that resulting transformed 
35 plants exhibit enhanced ability to produce a sterol or 
a carotenoid. As such, there is provided a method for 
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alteririg the biosynthetic pathway of various components 
in plants . 

In yet another aspect, the present invention 
relates to an antisense sequence for that sequence 
5 which corresponds to, or substantially to, the totality 
or a subset of that nucleic acid sequence specified by 
SEQ ID NO: 1- As such, that sequence encodes RNA 
molecules capable of inhibiting expression of the 
corresponding gene and genes related thereto. Antisense 

10 RNA is capable of reducing the expression of genes 
within plant cells. See, Ecker et al . , Prpc, Wa^^r 
Acad. Sci. USA , Vol. 83, p- 5372, (1986) and Rothstein 
et al ,. Proc, Nat. Acad> Sc i . USA, Vol. 84, p. 8439 
(1987) . The nucleotide sequence can be incorporated 

15 into plants (e.g., tobacco or tomato) using 

transformation techniques or viral gene expression 
systems. As a result, the present invention provides 
for a method for altering the synthetic pathway of 
sterols and isoprenoids. For example, the expression 

20 of an antisense molecule can be useful for suppressing 
or preventing synthesis of sterols (e .g., to cause a 
flux of total carbon available for biosynthesis of 
carotenoid compounds) or for preventing synthesis of 
certain isoprenoid compounds. Alternatively, 

25 expression of an antisense molecule may be useful in 
accumulating relatively high levels of certain 
molecules upstream of a particular metabolic block. 
Another example is the use of such antisense molecules 
to deliberately direct metabolites toward one branch of 

30 a branched pathway, such as that pathway involved in 
carotenoid biosynthesis . 

Byief De scription of the Drawings 
Figures l and 2 illustrate maps of plasmids 
35 pBGC802 and pBGC803, respectively, which each contain 
the WicoCiana Jbenthamiana squalene synthetase gene 



wo 96/09393 




PCTAJS9S/11280 



-8- 



cloned in as a PGR fragment into plasmid pCRII 
(Invitrogen, San Diego, CA) . 

Figure 3 illustrates a map of the plasmid 
pBGC804 containing the Nicotiana henthamiana squalene 
5 synthetase gene cloned as an Ndel/Sall fragment into 

the expression vector pT7-7. 

Figure 4 illustrates a map of the 
transcription plasmid pBGCBOS containing a 638 bp 
ant i sense fragment of Nicotiana benthsuniana squalene 
10 synthetase cloned into the Xhol and Avrll sites of the 
RNA viral vector TTOIA. 



Detailed Description o f the Preferred Embodiments 

The nucleotide sequences of genetic materials 

15 of the present invention are isolated from a plant, and 
preferably from a Nicotiana species. A plant is an 
organism belonging to the kingdom Plantae, See, Raven, 
et al, . p. 178 (1981) . Most preferably, the nucleotide 
sequences are isolated from Nicotiana benthamiana . 

20 Examples of other Nicotiana species include Nicotiana 
debnevi . Nicotiana alauca, Nicotiana alutinosa . 
Nicotiana rustica > Nicotiana svaveolens and Nicotiana 
tabacum . Examples of cultivars of Nicotiana tabacum 
include flue-cured tobacco (e.g., NK 326), Burley 

25 tobacco (e.g,, KY 14) and Maryland tobacco (e.g., MD 
609) . 

The nucleotide sequence of isolated genetic 
material of the present invention can be obtained by a 
variety of techniques. The sequence can be obtained by 

30 sequencing non-vector nucleotide sequences of 

recombinant molecules. Nucleotide sequence information 
can be obtained by employing widely used DNA sequencing 
protocols. Examples of suitable nucleotide sequencing 
protocols can be found in Berger and Kimmel, Methods in 

35 Enzvmoloay Vol. 51, Guide to Molecular Cloning 
Techniques . Academic Press (1987) . Nucleotide 
sequence information from several recombinant DNA 
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isolates, including isolates from both cDNA and genomic 
libraries, can be combined so as to provide the entire 
amino acid coding sequence, as well as the nucleotide 
sequences of upstream and downstream nucleotide 
5 sequences. 

For gene isolation, mRNA is converted into 
cDNA^ and the resulting cDNA is cloned. The cloned 
cDNA then can be used directly, or it, or sequences 
derived from it, can be utilized for acquiring the 

10 entire gene, either (i) from a library (e.g., in a 

lambda or plasmid vector) using sequence information to 
screen the library and detect the desired clone, or 
(ii) by amplification with PGR (polymerase chain 
reaction) and subsequent cloning into a suitable 

15 vector. For example, the 5' and 3' RACE (Rapid 

Amplification of cDNA Ends) reactions can be used to 
clone overlapping 5' and 3' ends of the gene of 
interest with subsequent assembly of the complete gene. 

Nucleotide sequences obtained from sequencing 

20 specific genetic library isolates can be subjected to 
' further analysis in order to identify regions of 
interest in the genetic material. These regions of 
interest include additional open reading frames, 
promoter sequences, termination sequences, and the 

25 like. 

Isolated DNA can be characterized as being 
selected from the group consisting of: 

(a) Isolated DNA consisting of DNA 
having the nucleotide sequence 

30 which corresponds to, or 

substantially to, SEQ ID NO: 1. 

(b) Isolated DNA which hybridizes to isolated 
DNA of (a) above which encodes an enzyme or fragment 
thereof having squalene biosynthetic activity. 

35 Hybridization of such sequences may, for example, be 
carried out under stringent conditions (e.g., 
conditions represented by a wash stringency of O.lx SSC 
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(ix SSC is 0,3M NaCl, 0.03M sodium citrate), 0.1% SDS 
at 60**C to DNA of (a) above) in a standard in situ 
hybridization assay. See J. Sambrook et al . , Molecular 
Cloning: A Laboratory Manual (2d Kd.) . Cold Spring 
5 Harbor Laboratory, Cold Spring Harbor, N.Y. (1989). In 
general, such sequences will be at least 95% 
homologous, often at least 98% homologous, and even at 
least 99% homologous with the sequences of (a) above. 

(c) Isolated ONA homologous to isolated DNA 

10 of (a) and (b) above. Homology relates to substantial 
or complete identity of nucleic acid sequences. Two 
nucleic acid fragments are homologous if they are 
capable of hybridizing to one another under 
hybridization conditions described in Maniatis et al . . 

15 Molecular Cloning; A Laboratory Manual. Cold Spring 

Harbor Laboratory, Cold Spring Harbor, N.Y., pp. 324- 
325 (1982) . Homologous sequences can be identified 
that contain less than about 5% base pair mismatches by 
using the following wash conditions: 2 x SSC, 0.1% SDS, 

20 room temperature twice, 30 minutes each; then 2 x SSC, 
room temperature twice, 10 minutes each. Homology can 
be assessed using computer programs such as DNASIS^ and 
PCGene (LKB, Hitachi Corporation, Japan, and 
Intelligenetics, Palo Alto, CA. ) whereby the degree of 

25 homology is within the limits of homology considered 
significant by Bost et al . . Biochem . Biophvs . Res . 
Commun , . Vol. 128, pp. 1373-1380 (1985). More 
preferably, homologous nucleic acid strands contain 
less than 2% base pair mismatches, even more preferably 

30 less than 1% base pair mismatches. These degrees of 
homology can be selected by using more stringent wash 
conditions for identification of clones from gene 
libraries (or other sources of genetic material) , as is 
well known in the art. 

35 (d) Isolated DNA differing from the isolated 

DNA of (a) , (b) and (c) above in nucleotide sequence 
due to the degeneracy of the genetic code, and which 
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encodes either (i) a polypeptide identical to the 
polypeptide specified by SEQ ID NO: 2, or (ii) an 
enzyme or fragment thereof having sterol or isoprenoid 
biosynthetic activity. 



5 



There is about 43 to about 4 9 percent 



conservation between the nucleotide coding region 
specified by SEQ ID NO: 1 and those squalene synthetase 
genes set forth in Jennings, et al . , Proc . Nat . Acad . 
Sci. USA , Vol. 88, p. 6038 (1991), McKenzie, et al . , J. 

10 Biol. Chem, , Vol, 267, p. 21368 (1992) and Robinson, £t 
alw MQl. Cell, Biol,, Vol. 13, p. 2706 (1993); as 
determined by PC/GENE Multiple Sequence Alignment 
program (IntelliGenetics, Inc., Mountain View, CA) . 
With regards to the polypeptide encoded by the nucleic 

15 acid sequence specified by SEQ ID NO: 1, there is a 
21.8 percent identity and 24.6 percent similarity 
between the amino acid sequence specified by SEQ ID NO: 
2 and the squalene synthetase sequences set forth in 
Jennings, et al . . Proc . Nat . Acad , Sci. USA . Vol. 88, 

20 p. 6038 (1991), McKenzie, et al . , J. Biol. Chem .. Vol. 

267, -p. 21368^ (1992) and Robinson, et al , . Mol . Cell, v.. 
Biol . , Vol. 13, p. 2706 (1993). However, there is 90 
percent homology between the conserved residues found . 
in known scfualene synthetase amino acid sequences and: 

25 those regions of homology found in the amino acid 

sequence specified by SEQ ID NO: 2. For the sequence 
specified by SEQ ID NO: 2, the conserved regions of 
amino acids are in positons 1, 40, 44, 48^52, 54, 55, 
58, 63, 66, 69-71, 73, 74, 77, 78, 80-82, 90, 94, 97, 

30 106, 108, 116i 122, 132, 136, 140, 147, 148, 150-152, 

168-177, 179, 180, 200-214, 216, 217, 225, 227-229, 
231, 233, 236, 248, 255, 262-264, 266, 267, 269, 271- 
273, 283, 285-294, 296-298, 303, 306, 311, 312, 314, 
316, 350, 351. 

35 The nucleotide sequences of the present 

invention are capable of encoding polypeptides having 
enzymatic activity for sterol biosynthesis and 
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isoprenoid biosynthesis. Such polypeptides have 
domains of amino acid sequences having catalytic 
activity. The catalytic domains demonstrate enzymatic 
activity, can be isolated, and can be expressed in in 
5 vivo or in vitro systems . 

A polypeptide derivative of a particular 
biosynthetic enzyme can differ in length from the 
natural enzyme, but typically contains numerous amino 
acids from the natural enzyme in the same primary order 

10 as found in that enzyme as obtained from a natural 

source. Such a polypeptide molecule' has substantially 
the same full length amino acid sequence as the natural 
enzyme but possesses minor amino acid substitutions 
that do not substantially affect the ability of that 

15 derivative to cause biosynthesis of sterols or 

isoprenoids. Derivatives include glycosylated forms, 
aggregative conjugates with other enzyme molecules and 
covalent conjugates with unrelated chemical moieties. 
Covalent derivatives are prepared by linkage of 

20 functionalities to groups which are found in the enzyme 

amino acid chain or at the N- or terminal residue by 
means known in the art . 

Isolated nucleotide sequences encoding 
biosynthetic enzyme can be used to produce purified 

25 enzyme or derivatives thereof by either recombinant DNA 
methodology or by in vitro polypeptide synthesis 
techniques. Purified and isolated polypeptide or 
nucleotide sequences are present in the substantial 
absence of other biological macromolecules of the same 

30 type. Purified genes and polypeptides of the present 
invention typically have at least 95% by weight, more 
preferably at least 99% by weight, and most preferably 
at least 99.8% by weight, of biological macromolecules 
of the same type present; but water, buffers, and other 

35 small molecules, especially molecules having a 

molecular weight of less than 1000, can be present. 
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Biosynthetic enzymes and polypeptide 
derivatives of those enzymes can be expressed by 
recombinant techniques when a DNA sequence encoding the 
relevant molecule is functionally inserted into a 
5 vector (e.g., in proper reading frame and orientation, 
as is well understood by those skilled in the art) . 
Typically, the relevant gene will be inserted 
downstream from a promoter and will be followed by a 
stop codon, although production as a hybrid protein 

10 followed by cleavage may be used, if desired. In 

general, host-cell-specific sequences improving the 
production yield of enzyme and enzyme derivatives can 
be used, and appropriate control sequences (e.g., 
enhancer sequences, polyadenylatxon sequences, and 

15 ribosome binding sites) can be added to the expression 
vector. 

A genetic construct can be prepared and used 
to transform plant cells- The transformed plant cells 
may be cells in culture, may be present as a 

20 disorganized mass in callus, leaf explants, or shoot 

.^,^cr. - .cult^^ a post -transformation ji 

differentiated plant or plant part, such as seeds, 
leaves, roots, or the like. The foreign construct 
normally is present in all or substantially all of the 

25 cells of the plant tissue, but expression may be 

limited to particular cells or particular times in the 
development of the plant. The foreign construct 
normally includes transcriptional and translational 
initiation and termination signals, with the initiation 

30 signals 5' to the gene of interest and the termination 
signals 3' to the gene of interest. 

The transcriptional initiation region which 
includes an RNA polymerase binding site (i.e., 
promoter) may be native to the host or may be derived 

35 from an alternative source, where the region is 
functional in the plant host. 
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The transcriptional initiation regions may 
include, in addition to the RNA polymerase binding 
site, regions providing for regulation of 
transcription. The 3' termination region may be 
5 derived from the same gene as the transcriptional 
initiation region or from a different gene. For 
example, where the gene of interest has a 
transcriptional termination region functional in the 
host species, that region may be retained with the 
10 gene. 

An exemplary expression cassette can be 
constructed to include a) a transcriptional initiation 
region, b) the biosynthetic enzyme gene under the 
transcriptional regulational control of the 

15 transcriptional initiation region, c) the translational 

initiation codon, d) the coding secpaence of the gene, 
with or without introns, and e) the translational stop 
codons, followed by f) the transcriptional termination 
region. The transcriptional termination region 

20 includes the terminator, and may include a 

polyadenylation signal sequence, and other sequences 
associated with transcriptional termination. The 
direction is 5 '-3' in the direction of transcription. 

When the expression product of the gene is to 

25 be located in a subcellular or extracellular 

compartment other than the cytoplasm, the gene usually 
is constructed to include particular amino acid 
sequences which result in translocation of the product 
to a particular site. That site may be an organelle, 

30 such as the chloroplast, mitochondrion, endoplasmic 

reticulum or nucleus, the cell plasma membrane, or may 
be secreted into the periplasmic space or into the 
external environment of the cell. Various secretory 
leaders, membrane integrator sequences, and 

35 translocation sequences for directing the peptide 

expression product to a particular site are described 
in the literature. See, for example, Cashmore al . . 
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Biotechnolocfv , Vol. 3, pp. 803-808 (1985), and Wickner 
and Lodish, Science , Vol. 230, pp. 400-407 (1985). 

. The expression cassette normally is carried 
on a vector having at least one replication system. 
5 For convenience, it is common to have a replication 

system function in E. coli such as ColEl, pSClOl, pA- 
Cyci84, or the like. In this manner, at each stage 
after each manipulation, the resulting construct may be 
cloned, sequenced, and the correctness of the 

10 manipulation determined. 

In addition to the replication system, there 
frequently is at least one marker present, which may be 
useful in one. or more hosts, or different markers for 
individual hosts. That is, one marker may be employed 

15 for. selection in a prokaryotic host, while another 

marker may be employed for selection in a eukaryotic 
host,, particularly the plant species host. The markers 
may be protection against a biocide, such as 
antibiotics, toxins, heavy metals, or the like; or 

20 complementation, imparting prototropy to an auxotrophic 

.1"',, ...host :v ^ ... ^ " ..f. 

The various fragments comprising the various 
constructs, expression cassettes, markers, and the like 
may be introduced consecutively by restriction enzyme 

25 cleavage of an appropriate replication system, and 

insertion of the particular construct or fragment into 
the available sites. After ligation and cloning, the 
vector may be isolated for further manipulation. All 
of these techniques are amply exemplified in the 

30 literature and find particular exemplification in 
Sambrook e£, si* « Molecular Cloning; A Laboratory 
Manual . Cold Spring Harbor Laboratory, Cold. Spring 
Harbor, N.Y., (1982). 

Once the vector is completed, the vector may 

35 be introduced into plant cells. Techniques for 
transforming plant cells include microinjection, 
particle bombardment, direct DNA update, such as using 
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polyethylene glycol, electroporation, viral infection, 
and transformation with Aarobacter ium . For 
representative techniques, see, D'Halluin et al , , The 
Plant g^ll. Vol. 4, p. 1495 (1992); Tomes et al . , Plant 
5 nol, Bipl., Vol. 14, p. 261 (1990); and Zhu et al , , 

Plant Cell, Tissue and Organ CuImitp^ Vol. 22, p. 145 
(1991) . 

The nucleotide seqpaence encoding a given 
enzyme can be functionally inserted into plants or 
10 transiently expressed by vijrus-based gene delivery 

systems. Such a system is described in European Patent 
Application Nos. 67,553, 194,809 and 406,267, PCT WO 
93/20217 and U.S. Patent Nos. 5,304,731 and 5,316,931; 
which are incorporated herein by reference. Such a 

15 system involves delivery of a selected nucleotide 

sequence as part of, or in conjunction with, a self- 
replicating DNA or RNA molecule (e.g., a virus), such 
that the exogenous gene is replicated and expressed 
during the course of replication and expression of 

20 viral or virus-based nucleic acids and proteins. Such 
gene delivery systems, in addition to enhancing gene 
"copy number" through the replicative potential of the 
given virus or virus-based nucleic acids, facilitate 
the timed delivery of such exogenous genes at the 

25 desired state of host development. 

An exemplary virus -based gene delivery system 
employs tobacco mosaic virus. A DNA copy of the virus 
has a DNA sequence of the present invention inserted 
therein using conventional techniques of molecular 

3 0 cloning. The cloned DNA copy of the resulting viral 
vector then is transcribed to produce an RNA. The 
resulting RNA transcript vector then can be used to 
inoculate a grown Nicotiana benthamxana or Nicotiana 
tobacxm plant by applying that RNA transcript onto a 

3 5 carborundum- dusted leaf in the direction of the leaf 

tip with a gloved finger. The plant so inoculated and 
infected with the virus vector is allowed to continue 
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growing for about 3 weeks and then is harvested. The 
harvested plant can be extracted immediately or frozen 
for storage purposes . This infected plant material , or 
the vector virions isolated from it/ can then be used 
5 for the inoculation of a multiplicity of plants for 

purposes of significantly amplifying the delivered gene 
or gene product. In order to accomplish this, the leaf 
tissue so collected is mixed with a lOmM phosphate 
buffer (pH 7.5) at a ratio of about 1:10 (w/v) . The 

10 mixture is macerated using a high speed blender, and 
centrifuged so as to obtain a liquid inoculum. The 
inoculum then can be applied to a growing plant, such 
as a tobacco plant, by applying the inoculum to injured 
(e.g., lacerated) regions of that plant. For example, 

15 the upper most leaves of the growing plant can be cut 
using an inoculum-wetted cutting blade (e.g., by spray 
nozzles located in the cutting blade); or the inoculum 
can be sprayed onto a specific region of one leaf of 
the plant as a high pressure spray. The plant is 

20 allowed to continue growing in order that the viral 

vector can systemically infect the plant. At a desired 
stage of infection, the plant is harvested. 

The following examples are provided in order 
to further illustrate the invention but should not be 

25 construed as limiting the scope thereof. Unless 
otherwise noted, all parts and percentages are by 
weight. 

Example 1 

30 Isolation of a gene encoding sgualene 

synthetase was carried out as follows: 

A. Isolation of RNA from Nicotiana 
b(gpt^^mian^ : 

About 100 mg of plant leaf tissue was 
35 harvested, transfered to a 1.5 ml microfuge tube, 

frozen with liquid nitrogen, and homogenized in one ml 
of AGRIsol (Biogentex, Houston, TX) using a teflon 
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pestle connected to a Moto Tool™ (Dremel Corp. , 
Racine, WI) , Immediately following homogenization, 100 
til of chloroform was introduced into the tube. The 
mixture was shaken by hand for 20 sec. Then, the 
5 mixture was incubated for 5 min. on ice, and 

centrifuged for 10 min. at 10,000 x g at 4*C. The 
aqueous phase of the centrifuged mixture was 
transferred to a separate tube, and the RNA was 
precipitated from that phase by addition of one volume 

10 of isopropanol. After incubation on ice for 5 min. 

that sample was centrifuged for 10 min. at 10,000 x g 
at 4*C. The resulting pellet was collected and washed 
twice with ethanol (75% in water) . The pellet was 
air-dried for about 15 min. The pellet was resuspended 

15 in 20 ill RNase-free dHjO. The RNA concentration was 
calculated by determining the ODa^o nm (1 ODa^o = 40 
Mg/ml) . 

B. Isolation of an internal cDNA fragment of 
sgualene synthetase . 

20 The literature was examined for nucleotide or 

amino acid sequences from human and yeast encoding the 
gene for squalene synthetase. Regions of conserved 
amino acid sequence were used to design suitable 
degenerate oligonucleotide primers. See Robinson et 

25 al . . Molecular and Cellular Biology . Vol. 13, p. 2706 

(May 1993) . Primers were synthesized from these 
sec[uences on an Applied Biosystems Model 391 PCR Mate, 
or were obtained from commercial sources. 

Approximately 150 ng of total RNA was mixed 

30 with 10 pg of primer SSP-1, (5'-GGC TCG AGA TIG CCA TIA 

CYT GNG GNA TNG CXC AXA A-3' , where X is G or A, Y is C 
or T, N is G, A, C, or T, and I is inosine) and dried 
in a lyophilizer. Five microliters of a 2.5 mM dNTP 
mix (dATP, dCTP, dGTP, and dTTP) was added to the dried 

35 RNA*primer pellet, heated 5 min at 70«*C, and cooled to 
37<>C. The reaction mixture was made up to 10 til with 
final concentrations of 50 mM Tris-HCl, pH 8.3, 50 mM 
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KCl, 5 inM MgCl2, 0.5 mM spermidine, and 10 mM DTT. 
Five units of avian myeloblastosis virus (AMV) reverse 
transcriptase (Promega, Madison, WI) were added and the 
reaction was incubated for 2 hr at 37^C. 
5 Following first strand cDNA synthesis, 10 pg 

of primer SSP-2, (S' -GGC TCG AGT AYT GYC AYT AYG TIG 
CNG GIC TNG TNG G-3' ) , where X is G or A, Y is C or T, 
N is G, A, C, or T, and I is inosine) , was added along 
with 5 fil of GeneAmp® lOX PGR buffer (500 mM KCl , 100 

10 mM Tris, pH 8.3, 15 mM MgCl^, and 0.01% (w/v) gelatin) 
(Perkin-Elmer Cetus, Norwalk, CT) , 30 /xl distilled, 
deionized water, and 5 units AmpliTag® DNA polymerase 
(Perkin-Elmer Cetus) . PGR amplification was performed 
using a DNA Thermal Cycler (Perkin-Elmer Cetus) with 1 

15 cycle at 95®C for 5 min, 35 cycles of 50**C for 2 min, 

72®C for 2 min, and 95«C for 2 min, and 1 cycle of 72®C 
for 9 min. The amplified products were re -amplified in 
a 50 pil reaction volume with final concentrations of 50 
mM KCl, 20 mM Tris-HCl, pH 8.3, 1.5 mM MgCl^, 0.01% 

20 (w/v) gelatin, 0.5 mM each of dATP, dGTP, dCTP, dTTP, 

. and 5 units AmpliTaq® DNA polymerase . The ,re~amplif i.ed 
.products were analyzed by separation on a 1% SeaKem® t 
agarose (FMC BioProducts, Rockland, ME) gel in TAE (Ix 
TAE Buffer is 4 0 mM Tris -Acetate, 2 mM EDTA) by gel 

25 electrophoresis, and a band corresponding to the 
predicted size of 4 00 bp was excised and the DNA 
purified from the gel fragment using Geneclean"* (Bio 
101, LaJolla, CA) . 

The isolated 400 bp PCR fragment was cloned 

30 into plasmid pCRII (Invitrogen Corporation, San Diego, 
CA) . Clones containing the 400 bp insert were subjected 
to DNA sequence analysis using the Sequenase® Version 
2.0 DNA Sequencing Kit (United States Biochemical 
Corporation, Cleveland, OH) . A representative clone, 

35 designated pSS400, was used for subsequent analysis and 
manipulation . 
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DNA sequence was determined by separation of 
the fragments on a polyacrylamide/urea gel (BioRad, 
Richmond, CA) , drying the gel on a sheet of Whatman 3MM 
paper (Whatman International Ltd., Maidstone England) 
5 using a BioRad Model 583 gel dryer. The dried gel was 
exposed to X-ray film, and the DNA sequence was read. 
The sequence was examined for homology with the 
published squalene synthetase sequences. The deduced 
amino acid sequence encoded by the Nicotiana 

10 bent hami ana squalene synthetase cDNA fragment was found 
to be 92% homologous to the conserved regions of amino 
acid sequence reported. See Robinson et al . . Molecular 
and Cellular Biology . Vol. 13, p. 2706 (May 1993). All 
non- identical amino acids in these regions were 

15 conserved substitutions. 

C. 3' RACE amplification of squalene 
synthetase . 

Sequence data generated from the 4 00 bp 
internal squalene synthetase cDNA fragment were used to 

20 design additional primers to facilitate the isolation 
of the complete cDNA. These were synthesized on an 
Applied Biosystems Model 391 PCR Mate, or were obtained 
from commercial sources. 

Approximately 150 ng of total RNA was heated 

25 with 10 pg of Adapter Primer (Gibco BRL, 5'-GGC CAC GCG 
TCG ACT AGT AC(T)x7-3') for 10 min. at 65®C and then 
chilled 2 min on ice. The reaction mixture was made up 
to 20 /il total volume with final concentrations of 20 
mM Tris-HCl (pH 8.4), 50 mM KCl, 2.5 mM MgClj, 100 fig/ml 

30 BSA, 10 mM DTT, 500 nM Adapter Primer and 0.5 mM each 
dATP, dCTP, dGTP and dTTP. The mixture was 
equilibrated 2 min at 42«C. 200 units of Superscript"" 
reverse transcriptase were added and the mixture 
inctibated 30 min. at 42«*C. Two units of E. coli RNase 

35 H was added and the mixture incubated 10 min at 42*'C to 
yield the first strand cDNA. 
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The first strand cDNA was directly amplified 
by the PCR method. A 50 ^1 reaction was assembled with 
a final composition of 20 mM Tris-HCl (pH 8.4), 50 mM 
KCl, 2.5 mM , ICQ /ig/ml BSA, 200 nM primer SSP-3 (5'- 
5 TCC ATG GGT TTA TTT CTT CAG -3') 200 nM Universal 

Amplification Primer (Gibco BRL, 5'-CUA CUA CUA CUA GGC 
CAC GCQ TCG ACT AGT AC -3'), 0.2 mM each dATP, dCTP, 
dGTP and dTTP, and 5 units AmpliTaq® DNA Polymerase 
(Perkin-Elmer Cetus) . Amplification was in a DNA 

10 Thermal Cycler with 1 cycle of 5 min. at 94**C, 35 

cycles of 2 min at 94<>C, 2 rain at 55**C, 2 min at 72«»C, 
and 1 cycle of 5 min. at 72«C. An aliquot of the 
amplified PCR products was used directly in a nested 
amplification performed in a 50 ^ll reaction with a 

15 final composition of 10 mM Tris-HCl (pH 8.3), 50 mM 

KCl, 1.5 mM , 0.001% (w/v) gelatin, 0.2 mM each dATP, 
dCTP, dGTP and dTTP, 200 nM Universal Amplification 
Primer, 200 nM primer SSP-4 (5'-CAA TGT CTA AAT GAC ATG 
GTC ACT AAT GC- 3'), 1 Ml PCR products, and 2.5 units 

20 AmpliTaq® DNA Polymerase. Amplification was performed 
' - in a DNA Thermal Cycler with 1- cycle, of : 5 min. at 94*=*.C,r 
35 cycles of 2 min at 94^C, 2 min at 55**C, 2 min at 
72^C, and 1 cycle of 5 min. at 72°C. 

D. Southern Analysis of PCR Products: 

25 Nested PCR 3' RACE products were analyzed by 

separation on a 1% SeaKem® agarose (FMC BioProducts, 
Rockland, ME) gel in TAE buffer. The PCR products in 
the gel were denatured by treatment for 20 min. in 1.5 
M NaCl, 0.5 N NaOH and neutralized by soaking in 

30 several volumes of 1.5 M NaCl, 1 M Tris-HCl (pH 8.0). 
The PCR products were transferred to a Flash® nylon 
membrane (Stratagene Cloning Systems, La Jolla, CA) 
using a PosiBlot® pressure blotter (Stratagene) and 
were UV*crossl inked to the membrane using a 

35 Stratalinker® 2400. 

Hybridization and detection of scjualene 
synthetase homologous sequences were performed using 
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the chemilurainescent Genius® system (Boehringer 
Mannheim Biochemicals, Indianapolis, IN) . The membrane 
was prehybridized for 1 hr. at 60**C in 10 ml 
hybridization solution (5X SSC, IX Blocking solution, 
5 0.1% N-laurylsarcosinate, 0.02% sodium dodecyl sulfate 
(SDS) ; lOX Blocking solution is 10% (w/v) Blocking 
Reagent (Boehringer Mannheim Biochemicals, 
Indianapolis, IN) in 100 mM maleic acid at pH 7.5, 150 
mM NaCl) . Incubations were in a Model 1040-01-0 

10 rotating bottle hybridization incubator (Robbins 

Scientific, Sunnyvale, CA) . Heat -denatured (5 min. at 
100 'O probe was added to the prehybridization mixture 
at a final concentration of 1.25 ng/ml, and incubation 
of the resulting mixture was continued 4-16 hr. at 

15 60*C. The membrane was washed twice for 5 min. in 2X 
SSC, 0.1% SDS at room temperature, and twice for 15 
min. in O.IX SSC, 0.1% SDS at 60*>C. For 
chemiluminescent detection, the membrane was rinsed in 
100 mM maleic acid, 150 mM NaCl, and incubated for 30 

20 min. at room temperature in IX Blocking solution. 

Alkaline phosphatase-conjugated anti-digoxigenin Fab 
fragments were added (1:10,000 dilution), and 
incubation was continued for 30 min. at room 
temperature. The membrane was washed twice for 15 min. 

25 in 100 mM maleic acid, 150 mM NaCl, at room 

temperature. The membrane was equilibrated for 2 min. 
in 100 mM Tris-HCl (pH 9.5), 100 mM NaCl, 50 mM MgClj. 
The membrane was placed on a sheet of clear acetate 
film and Lumi-Phos® 530 (Boehringer Mannheim 

30 Biochemicals, Indianapolis, IN) was added on the DNA 
side of the membrane. Another sheet of acetate film 
was used to cover the membrane, and the membrane was 
exposed to X-ray film. Positive signals on the X-ray 
film indicated the presence of PCR products containing 

35 scjualene synthetase homologous sequence. 

Chemiluminescent probe for the hybridization 
procedure was generated by labeling the 400 bp insert 
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seguences of plasmid pSS400. One fig of plasmid pSS400 
was digested with EcoRI to excise the insert . The 
scpialene synthetase cDNA fragment was separated on an 
agarose gel and excised after visualization by ethidium 
5 bromide staining. The DNA was purified from the gel 

fragment using Geneclean™ and resuspended in 14 /il dHjO. 
The DNA was heat -denatured 10 min at 100°C and chilled 
on ice. The Genius® Kit 2 (Boehringer Mannheim 
Biochemicals) was used to label the fragment with 
10 digoxigenin-dUTP according to the manufacturer's 

protocols. The probe concentration was determined by 
comparison with a dilution series of a known standard 
on a Flash® membrane, detected by chemiluminescence as 
above. 

15 E. Cloning of the 3' RACE products. 

The nested PCR 3' RACE products of squalene 
synthetase were cloned using the pCRII vector. Clones 
were screened by digestion with EcoRI to liberate the 
insert, separation on an agarose gel, transfer to a 

20 nylon membrane, and hybridization and detection as 
w described above. Clones exhibiting a positive-- 

hybridization signal were subjected to DNA sequence 
analysis as described above. The sequence was examined 
for homology with the published squalene synthetase 

25 sequences. Clones were identified that exhibited at 
least 90% homology to the conserved regions in other 
squalene synthetase genes. 

F. 5' RACE amplification and cloning of 
squalene synthetase. 

30 The 5' RACE procedure (Frohman et al . , Proc. 

Nat. Acad, Sci, USA . Vol. 85, pp. 8998-9002 (1988)) was 
used to amplify the 5' terminus of squalene synthetase 
mRNA. 

Four to six fig total Nicotiana benthamiana 
35 RNA was combined with 10 pmole of primer SSP- 5 (5'- 

GTA AGT CAA ACA ATC TTC TAC ATG TGA-3') and the mix was 
heated for 10 min at lO^C and chilled on ice. The 
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reaction was brought to a volume of 20 pil in 2 0 mM 
Tris-HCl (pH 8.4, 50 mM KCl , 2 . 5 mM MgClj, 0.1 mg/ml 
BSA, 10 mM DTT, 0.2 mM each dATP, dCTP, dGTP and dTTP, 
and 200 units reverse transcriptase. The mixture was 
5 incubated for 10 min at room temperature and 50 min at 
42**C. The reaction was stopped by heating for 5 min at 
90*^C and chilled on ice. Two units E. coli RNAse H 
were added and the reaction incubated 20 min at 37^0. 
The single-stranded cDNA was purified from the previous 

10 reaction using Geneclean"*, resuspended in 16 /xl dH20, 
heated 10 min at 70**C and chilled on ice. The tailing 
reaction was initiated by adding 1 fil of lOX reaction 
buffer (200 mM Tris-HCl, (pH 8.4), 500 mM KCl, 25 mM 
MgClj, 1 mg/ml BSA) , 2 /xl 2 mM CTP, and 15 units of 

15 terminal transferase (TdT) to the single -stranded cDNA. 
After 10 min at 37^C, the TdT was inactivated for 10 
min at 70®C and the reaction chilled on ice. 

Ten nl of the dC-tailed cDNA was used in the 
amplification step. The volume of the reaction was 

20 brought to 100 ^il by the addition of 76.5 fil dHjO, 9.5 
fil lOX reaction buffer, 0.2 mM each dATP, dCTP, dGTP 
and dTTP, 10 pmole primer SSP-6 (5'-CTG AAG AAA TAA ACC 
CAT GGA -3'), 10 pmole Anchor primer (Gibco BRL, 5'-CUA 
CUA CUA CUA GGC CAC GCG TCG ACT AGT ACG GGI IGG GII GGG 

25 IIG-3') and 5 units Tag DNA polymerase (Gibco BRL) . 

The amplification was performed with one cycle of 2 min 
at 94»C, 37 cycles of 30 sec at 94*»C, 15 sec at 55*>C, 
40 sec at 72*>C, and one cycle at 72«C for 5 min. The 
amplification products were analyzed by agarose gel 

30 electrophoresis and a DNA fragment of approximately 800 
bp was visualized after ethidium bromide staining. 

One microliter of the first reaction was 
amplified by PCR using the Universal Amplification 
Primer (UAP) (Gibco BRL, 5'- CUA CUA CUA CUA GGC CAC 

35 GCG TCG ACT AGT AC -3') . The reaction was composed of 
80.5 Ml dHjO, 10 111 lOX Tag polymerase buffer (Gibco 
BRIi) , 5 Ml 50 mM MgClj, 1 Ml of the DNA Polymerization 
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Mix (Pharmacia LKB Biotechnology, Piscataway, NJ) , l /il 
of the first amplification reaction, and 2.5 units Tag 
polymerase. The amplification was performed with one 
cycle of 2 min at 94**C, 30 cycles of 40 sec at 94^C, 30 
5 sec at 55**C, 1 min at 72**C, and one cycle at 72 ®C for 5 
min. The amplified products were purified using a 
Magic PCR Prep® column (Promega, Madison, WI) as 
recommended by the manufacturer followed by agarose gel 
electrophoresis. The resulting 800 bpDNA fragment was 

10 excised from the agarose and purified using Geneclean*". 
The fragment was cloned into a pCRII vector using 
standard cloning technicjues. The clones were subjected 
to DNA sequence analysis using the Seguenase Version 
2 . 0 DNA Sequencing Kit (United States Biochemical 

15 Corporation, Cleveland, OH) . The sequence was examined 
for homology with the published squalene synthetase 
sequences. Clones were detected that exhibited at 
least 90% homology to the conserved regions in other 
squalene synthetase genes. 

20 G. Cloning of Squalene Synthetase. 

. A region of the scpialene synthetase .v 

encompassing the start codon and the stop codon was 
amplified using the PCR as follows: Total RNA from 500 
mg of fresh leaves of Nicotieuia Jben thamiana was 

25 extracted with Trizol™ RNA extraction kit (Gibco BRL) . 
The first DNA strand was synthesized by a reverse 
transcription reaction using 6 fig of total RNA as a 
template, oligo (dT) ^,.^8 as primer (0.5 Mg) f and the 
Superscript™ preamplif ication kit (Gibco BRL) . The 

30 RNA-primer mix was brought to a total volume of 14 /il, 
heated for 10 min at 70**C, then chilled on ice. The 
reaction volume was adjusted to 20 fil by adding 2^1 of 
lOX synthesis buffer (200 mM Tris-HCl (pH 8.4) , 500 mM 
KCl, 25 mM MgCl,, 1 mg/ml BSA) , 2;xl of 0 . 1 M DTT, 1 fil 

35 of 10 mM dNTP mix and 1 /il (200 units) of reverse 

transcriptase. The mixture was incubated for 10 min at 
room temperature and 50 min at 42^C. The reaction was 
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stopped by heating 5 min at 90®C and chilled on ice. 
Then, 1 ^ll of E. coli RNAse H (two units) was added and 
the mixture incubated 20 rain at 31 ^C, 

For the polymerase chain reaction (PGR) step, 
5 two primers (10 pmole of each) were added (SSP-7 (5'- 
CAA CGC TTT CTC CTC TTA CCA -3') and SSP-8 (5'-CGT CTA 
AGA TCG GTT TCC C3GA -3')) and the reaction mix volume 
increased to 100 ^1 with 8 fil of 10 X synthesis buffer, 
0.5 fil of Tag DNA polymerase and water. PCR was 

10 performed using 94'»C for 40 sec, 55*»C for 20 sec and 

72*»C for 1 min for 35 cycles, and 94*»C for 40 sec, 55**C 
for 20 sec and 72®C for 6 min for one cycle. 

The PCR products were analyzed by agarose gel 
electrophoresis. A DNA fragment of the expected size 

15 of 1275 bp was excised from the agarose gel, and 

purified using Geneclean™. The PCR product was then 
ligated to the pCR II using the manufacturer's 
recommendations. Following transformation, fourteen 
colonies were analyzed by plasmid DNA isolation and 

20 digestion with JSco RI . Plasmid DNA from three 

independant clones containing the expected 1275 bp 
insert were purified using the Wizard midiprep 
purification kit (Promega Corp.) and the sequences 
determined. A single example (designated pBGC802, 

25 Figure 1) was chosen and used as a template for 
specific amplification of the coding region. 

Amplification of the squalene synthetase 
coding region was carried out as follows: 
Plasmid pBGC802 (10 ng) was used as a template in 

30 amplification by PCR using primers SSP-9 {5'-GCG CAT 

ATG GGG AGT TTG AGG GCT ATT CT-3') and SSP-10 (5'- CGA 
GTC GAC TAA GAT CGG TTT CCG GAT AG -3') . Primer SSP-9 
was homologous to squalene synthetase gene sequences 
containing the translational initiation codon. 

35 Additionally, several nucleotides were changed to 
introduce an Nde 1 restriction site in the primer. 
Primer SSP-10 was complementary to squalene synthetase 
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gene sequences containing the translatibnal termination 
codon with several nucleotides changed to create a Sal 
1 restriction site dovmstream from the stop codon. 

The amplification reactions consisted of 80.5 
5 111 of water, 10 fil of lOX Tag DNA polymerase buffer 
(Gibco BRL) , 5 ^1 of MgCla 50 mM, 1 fil (10 pmole) of 
each primer SSP-9 and SSP-10, 1 ^1 of the DNA 
Polymerization Mix, 1 ^1 of the uncut plasmid DNA 
(pBSG802 at 10 ng/fil) and 0.5 /xl (2.5 units) of Tag DNA 

10 Polymerase (Gibco BRL) . The amplifications were 

performed with 4 min incubation at at 94**C; 2 cycles of 
40 sec at 94**C, 20 sec at 45^C, 1 min at 72«*C; 20 
cycles of 40 sec at 94®C, 20 sec at 55**C, 1 min at 
72**C; and 5 min incubation at 72®C. 

15 The amplified products were analyzed by 

agarose gel electrophoresis and the single DNA fragment 
of 1240 bp was purified using Geneclean®. The PGR 
products were ligated to the pCR II vector using the 
directions of the manufacturer. The ligation products 

20 were used to transform E. coli competent cells 

:.v^'->A (OneShot^ INVaF' competent cells, Invitrogn Corp. r 

Analysis of plasmid DNA from the resulting bacterial 
colonies revealed an insert of approximately 1245 bp. 
Plasmid DNA was purified using the Wizard midiprep 

25 purification kit and the DNA sequence determined. The 
resulting clone, designated pBGC803 (Figure 2), thus 
consisted primarily of the Nicotiana benthamiana 
sgualene synthetase coding region . " 

Cloning of the sgualene synthetase gene in a 

30 bacterial expression vector was carried out as follows: 

Plasmid pBGC803 was cut with Sal I and with 
Wde I in order to excise the sgualene synthetase coding 
region. The digestion products were analyzed by 
agarose gel electrophoresis and the 1240 bp DNA 

35 fragment purified using Geneclean"*. Plasmid pT7-7 

(Tabor et al . , Proc. Nat. Acad. Sci , USA. Vol. 82, p- 
1074 (1985)} was cut with Nde I and Sal I and the 
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squalene synthetase coding region was inserted followed 
by transformation of DHIOB competent cells (Gibco BRL) . 
One of the resulting clones, designated pBGC804 
(Figure 3) , contains the -Wde 1/Sal I DNA fragment of 
5 clone pBGC803. 

Competent cells of bacterial strain BIi21 
(DE3) (Novagen, Inc., Madison, WI) were transformed 
with plasmid pBGC804 . A culture was started from a 
well isolated colony in media containing 50 ptg/mL of 

10 ampicillin. When the OD^qq was approximately equal to 

0.4, IPTG was added to a final concentration of 0.4 mM. 
The culture was maintained at 37®C with agitation. At 
0, 10 min, 30 min, 60 min, 90 min, 120 min and 240 min 
post -induction with IPTG, aliquots of 1.5 mL were taken 

15 and the cells pelleted at 14,000 rpm for 3 min. The 
cell pellet was then resuspended with 100 fil of IX 
loading buffer and 10 til of each aliquot was analyzed 
for total protein content using a 12% SDS- 
polyacrylamide precast gel (Novex, San Diego, CA) . A 

20 protein of approximately 45 kDa increased in intensity 

from time 0 to time 24 0 min post -induction with IPTG. 
, The calculated molecular weight for the squalene 
synthetase protein is 47,073. 
Example 2 

25 Insertion of a 700 bp antisense Nicotiana 

benthamiana squalene synthetase sequence into a virus- 
based gene expression system and expression in 
transfected plants was accomplished as follows: 

A. Insertion of the squalene synthetase gene 

30 into an RNA viral vector: RNA viral vector TTOIA was 
constructed as follows: 

An 861 bp fragment from tomato mosaic virus 
virion RNA (fruit necrotic strain F; ToMV-F) was 
isolated by RT-PCR using ToMV primers 5'-CTC GCA AAG 

35 TTT GGA ACC AAA TCC TC-3' (upstream) and 5'-CGG GGT ACC 

TGG GCC CCA ACC GGG GGT TCC GGG GG-3' (downstream) and 
subcloned into the Hindi site of pBluescript KS- 
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(Stratagene, LaJolla, CA) . A hybrid virus consisting 
of TMV-Ul and ToMV-F was constructed by swapping an 874 
bp BajnHJ/KpjiJ ToMV fragment into pBGC152. See, Kumagai 
et al . . Proc. Nat. Acad. Sci . USA . Vol. 89, p. 427 
5 (1993), creating plasmid pTTOl . A unique Avrll site 

was inserted downstream of the Xhol site in TTOl by PGR 
mutagenesis, creating plasmid TTOIA, using the primers 
5'-TCC TCG AGC TAG GCT CGC AAA GTT TCG AAC CAA ATC CTC 
A-3' (upstream) and 5'-CGG GGT ACT GGG CCC CAA CCG GGG 
10 GTT CCG GGG G-3' (downstream) . 

The squalene synthetase 3' RACE product was 
cloned into plasmid pCRII. Unique Xhol and Avrll sites 
were inserted into a 642 bp antisense fragment by PGR 
mutagenesis. A 50 ^1 reaction was assembled with a 
15 final composition of 20 mM Tris-HCl (pH 8.4), 50 mM 

KCl, 2.5 mM MgClj, 0.01% (w/v) gelatin, 0.5 mM each of 
dATP, dCTP, dGTP,and dTTP, 200 nM primer SSP-11 (5'- 
GCG GCT CGA GGA CTC TAG AAC AC-3'), 200 nM primer SSP- 
12 (5'- CGG GCT AGG AGA TGT CTG GTT TGC-3'), and 5 
20 units AmpliTaq® DNA polymerase. The amplification was 
performed -with one cycle of 2 min at 94"'C, 30 cycles of 
40 sec at 94"»C, 30 sec at 55«»C, 1 min at 72*»G, and one 
cycle at 72«'G for 5 min. The amplified products were 
analyzed by separation on a 1% SeaKem® agarose gel in 
25 TAE by gel electrophoresis and a band corresponding to 
the predicted size was excised and the DNA purified 
using Geneclean"*. The isolated antisense scpialene 
synthetase fragment was cloned into the Xhol and Avrll 
sites of TTOIA to create plasmid pBGC805 (Figure 4). 
30 Plasmid pBGC805, was linearized with Kpnl and 

transcripts were synthesized in vitro using a 
MEGAscript SP6 Kit and m7G(5' )ppp(5' )G (Ambion, Austin. 
TX) . N. icotiana benthamiana plants were inoculated 
with in vitro transcripts and maintained in a growth 
35 chamber at 25"C and a 12 hr light/dark regime for 13 
days. At 6 and 13 days post -inoculation, infected 
leaves were harvested and analyzed for squalene 
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synthetase activity. Isolation of total plant 
membranes and assay for squalene synthetase enzyme 
activity were performed as previously described. See 
Hanley et_al. , Plant Phvsiol ■ , Vol. 98, p. 215 (1992). 
5 By 13 days post -inoculation squalene synthetase 
activity in the plants transfected with the anti- 
squalene synthetase construct was undetectable. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Hanley, Kathleen M. 

Nicolas , Olivier 
Hellmaxin, Gary M. 

(ii) TITLE OF INVENTION: DNA Sequence encoding squalene 
synthetase 

(iii) NUMBER OF SEQUENCES: 2 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: R. J. Reynolds Tobacco Company 

(B) STREET: 950 Reynolds Blvd., P. O. Box 1487 

(C) CITY: Winston-Salem 

(D) STATE: North Carolina 

(E) COUNTRY: U, S. A. 

(F) ZIP: 27102-1487 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Borschke, August J. 

(B) REGISTRATION NUMBER: 30,539 

(C) REFERENCE /DOCKET.^N^^ CC-212A 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 910-741-5491 

(B) TELEFAX: 910-741-5449 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CKA3RACTERISTICS : 

(A) LENGTH: 1642 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Nicotiana benthamiana 
(F) TISSUE TYPE: Leaf 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 112.. 1344 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

ATCATCGTTT CCACGCAATT CATCGTTCAA CGACTTCTTC AAATACAACC ATTTTGCCAA 60 

CGCTTTCTCC TCTTACCAAA CAAACAACCC CCCACCAAAG GCTGAATAAG A ATG GGG 117 

Met Gly 
1 

AGT TTG AGG GCT ATT CTG AAG AAT CCA GAG GAT TTA TAT CCA TTG GTG 165 
Ser Leu Arg Ala lie Leu Lys Asn Pro Glu Asp Leu Tyr Pro Leu Val 
5 10 15 

AAG CTG AAG CTA GCG GCT CGA CAC GCG GAG AAG CAG ATC CCG CCG TCT 213 
Lys Leu Lys Leu Ala Ala Arg His Ala Glu Lys Gin lie Pro Pro Ser 
20 25 30 

CCA AAT TGG GGC TTC TGT TAC TCA ATG CTT CAT AAG GTT TCT CGT AGC 261 
Pro Asn Trp Gly Phe Cys Tyr Ser Met Leu His Lys Val Ser Arg Ser 
35 40 45 50 

TTT GCT CTC GTC ATT CAA CAA CTT CCA GTC GAG CTT CGT GAC GCC GTG 309 
Phe Ala Leu Val lie Gin Gin Leu Pro Val Glu Leu Arg Asp Ala Val 
55 60 65 

TGC ATT TTC TAT TTG GTT CTT CGA GCA CTT GAC ACT GTT GAG GAT GAT 357 
Cys lie Phe Tyr Leu Val Leu Arg Ala Leu Asp Thr Val Glu Asp Asp 
70 75 80 

ACC AGC ATT CCC ACC GAT GTT AAA GTT CCT ATT CTG ATC TCT TTT CAT 405 
Thr Ser lie Pro Thr Asp Val Lys Val Pro lie Leu lie Ser Phe His 
65 90 95 

CAG CAT GTT TAT GAT CGC GAA TGG CAT TTT TCA TGT GGT ACA AAG GAG 453 
Gin His Val Tyr Asp Arg Glu Trp His Phe Ser Cys Gly Thr Lys Glu 
100 105 110 

TAC AAG GTT CTC ATG GAC CAG TTC CAT CAT GTA TCA ACT GCT TTT CTG 501 
Tyr Lys Val Leu Met Asp Gin Phe His His Val Ser Thr Ala Phe Leu 
115 120 125 130 

GAG CTT AGG AAA CAT TAT CAG CAG GCA ATT GAG GAT ATT ACC ATG AGG 549 
Glu Leu Arg Lys His Tyr Gin Gin Ala lie Glu Asp lie Thr Met Arg 
135 140 145 

ATG GGT GCA GGA ATG GCA AAA TTC ATA TGC AAG GAG GTG GAA ACA ACC 597 
Met Gly Ala Gly Met Ala Lys Phe lie Cys Lys Glu Val Glu Thr Thr 
150 155 160 
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GAT GAT TAT GAC GAA TAT TGT CAC TAT GTA GCT GGG CTT GTT GGG CTA 645 
Asp Asp Tyr Asp Glu Tyr Cys His Tyr Val Ala Gly Leu Val Gly Leu 
165 170 175 

GGA TTG TCA AAA CTG TTC CAT GCC TCT GAG AAA GAA GAT CTG GCT TCA 693 
Gly Leu Ser Lys Leu Phe His Ala Ser Glu Lys Glu Asp Leu Ala Ser 
180 185 190 

GAT TCT CTC TCC AAC TCC ATG GGT TTA TTT CTT CAG AAA ACA AAC ATC 741 
Asp Ser Leu Ser Asn Ser Met Gly Leu Phe Leu Gin Lys Thr Asn lie 
195 200 205 210 

ATT AGA GAT TAT TTG GAA GAC ATA AAT GAA GTA CCC AAG TGC CGT ATG 789 
lie Arg Asp Tyr Leu Glu Asp lie Asn Glu Val Pro Lys Cys Arg Met 
215 220 225 

TTC TGG CCC CGT GAA ATA TGG AGT AAA TAT GTT AAC AAG CTT GAG GAA 837 
Phe Trp Pro Arg Glu lie Trp Ser Lys Tyr Val Asn Lys Leu Glu Glu 
230 235 240 

TTA AAG TAC GAG GAT AAC TCG GCC AAA GCA GTG CAA TGT CTA AAT GAC 885 
Leu Lys Tyr Glu Asp Asn Ser Ala Lys Ala Val Gin Cys Leu Asn Asp 
245 250 255 

ATG GTC ACT AAT GCT TTA TCA CAT GTA GAA GAT TGT TTG ACT TAC ATG 933 
Met Val Thr Asn Ala Leu Ser His Val Glu Asp Cys Leu Thr Tyr Met 
260 265 270 

TCT GCT TTG CGT GAT CCT TCC ATC TTT CGA TTC TGT GCT ATT CCA CAG 981 
Ser Ala Leu Arg Asp Pro Ser lie Phe Arg Phe Cys Ala lie Pro Gin 
275 280 285 290 

GTC ATG GCA ATT GGG ACA TTA GCT ATG TGC TAC GAC AAC ATT GAA GTC 1029 
Val Met Ala He Gly Thr Leu Ala Met Cys Tyr Asp Asn He Glu Val 
295 300 305 

_ TTq AGA GGA GTG GTA AAA ATG AGA CGT GGT CTG ACT GCT AAG GTC ATT 1Q77 
- Phe Arg Gly Val Val Lys Met Arg Arg Gly Leu Thr Ala Lys Val lie ' 
310 315 320 

GAC CGG ACC AGG ACT ATT GCA GAT GTA TAT GGT GCT TTT TTT GAC TTT 1125 
Asp Arg Thr Arg Thr He Ala Asp Val Tyr Gly Ala Phe Phe Asp Phe 
325 330 335 

TCT TGT ATG CTG AAA TCC AAG GTT AAT AAT AAT GAT CCA AAT GCA ACA 1X73 
Ser Cys ' Met Leu Lys Ser Lys Val Asn Asn Asn Asp Pro Asn Ala Thr . 
340 345 350 

AAA ACT CTG AAG AGG CTC GAA GTG ATC CTG AAA ACT TGC AGA GAT TCG 1221 
Lys Thr Leu Lys Arg Leu Glu Val He Leu Lys Thr Cys Arg Asp Ser 
355 360 365 370 

GGA ACC TTG AAC AAA AGG AAA TCC TAC ATA ATC AGG AGC GAG CCT AAT 1269 
Gly Thr Leu Asn Lys Arg Lys Ser Tyr He He Arg Ser Glu Pro Asn 
375 380 385 
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TAG AGT CCA GTT CTG ATT GTT GTC ATT TTC ATC ATA CTG GOT ATT ATT 1317 
Tyr Ser Pro Val Leu lie Val Val He Phe He He Leu Ala lie He 
390 395 400 

CTC GCA GAG CTA TCC GGA AAC CGA TCT TAGACGATAT TTTGGGTTAC 1364 
Leu Ala Gin Leu Ser Gly Asn Arg Ser 
405 410 

AAAAAAGAAG TCTGGTCAAG GAAGACAGCA GAAGCTCTTG GCCAATTGTG TGATTAGTGC 1424 

AGATTTTGAT GTTTGTAATT CTATCGTCCA TTAAGTGATA GTTGCACCTT TTAACCTGAC 1484 

AAGATAATTA CGAAGACCTA TTTTTGGTGG TTTGTTGGGT ATGTACTTGT TTGCAAGGCT 1544 

ACGTAAGCAA ATTCCAAGTG TTGTAGAGTC ACCGTGATGT AATAAACATG TCTTTTATTA 1604 

TAGTTTGTTC ATTTTTGTGG TAAAAAAAAA AAAAAAAA 1642 



(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Gly Ser Leu Arg Ala He Leu Lys Asn Pro Glu Asp 
15 10 

Leu Val Lys Leu Lys Leu Ala Ala Arg His Ala Glu Lys 
20 25 

Pro Ser Pro Asn Trp Gly Phe Cys Tyr Ser Met lieu His 
35 40 45 

Arg Ser Phe Ala Leu Val He Gin Gin Leu Pro Val Glu 
50 55 60 

Ala Val Cys He Phe Tyr Leu Val Leu Arg Ala Leu Asp 
65 70 75 

Asp Asp Thr Ser He Pro Thr Asp Val Lys Val Pro He 
85 90 

Phe His Gin His Val Tyr Asp Arg Glu Trp His Phe Ser 
100 105 

Lys Glu Tyr Lys Val Leu Met Asp Gin Phe His His Val 
115 120 125 

Phe Leu Glu Leu Arg Lys His Tyr Gin Gin Ala He Glu 
130 135 140 



Leu Tyr Pro 
15 

Gin He Pro 
30 

Lys Val Ser 

Leu Arg Asp 

Thr Val Glu 
60 

Leu He Ser 
95 

Cys Gly Thr 
110 

Ser Thr Ala 
Asp He Thr 
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Met Arg Met Gly Ala Qly Met Ala Lys Phe lie Cys Lys Glu Val Glu 
145 150 155 160 

Thr Thr Asp Asp Tyr Asp Glu Tyr Cys His Tyr Val Ala Gly Leu Val 
165 170 175 

Gly lieu Gly Leu Ser Lys Leu Phe His Ala Ser Glu Lys Glu Asp Leu 
180 185 190 

Ala Ser Asp Ser Leu Ser Asn Ser Met Gly Leu Phe Leu Gin Lys Thr 
195 200 205 

Asn lie lie Arg Asp Tyr Ijeu Glu Asp He Asn Glu Val Pro Lys Cys 
210 215 220 

Arg Met Phe Trp Pro Arg Glu He Trp Ser Lys Tyr Val Asn Lys Leu 
225 230 235 240 

Glu Glu Leu Lys Tyr Glu Asp Asn Ser Ala Lys Ala Val Gin Cys Leu 
245 250 255 

Asn Asp Met Val Thr Asn Ala Leu Ser His Val Glu Asp Cys Leu Thr 
260 265 270 

Tyr Met Ser Ala Leu Arg Asp Pro Ser He Phe Arg Phe Cys Ala He 
275 280 285 

Pro Gin Val Met Ala He Gly Thr Leu Ala Met Cys Tyr Asp Asn He 
290 295 300 

- Glu Val Phe Arg Gly Val Val Lys Met Arg Arg Gly Leu Thr Ala Lys 
305 310 315 320 

Val He Asp Arg Thr Arg Thr He Ala Asp Val Tyr Gly Ala Phe Phe 
325 330 335 

ASP Phe Ser Cys Met Leu Lys Ser Lys Val Asn Asn Asn Asp Pro Asn 
340 345 - 350 . 

Ala Thr Lys Thr Leu Lys Arg Leu Glu Val He Leu Lys Thr Cys Arg 
355 360 365 

Asp Ser Gly Thr Leu Asn Lys Arg Lys Ser Tyr He He Arg Ser Glu 
370 375 380 

Pro Asn Tyr Ser Pro Val Leu He Val Val He Phe He He Leu Ala 
385 390 395 400 

He He Leu Ala Gin Leu Ser Gly Asn Arg Ser 
405 410 
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THAT WHICH IS CIiAIMED IS : 

1. An isolated nucleotide molecule encoding a 
polypeptide which has enzymatic activity for producing 
sgualene, which nucleotide molecule is isolated from a 
Nicotians, species* 

5 

2 . An isolated nucleotide molecule encoding a 
polypeptide which has enzymatic activity for producing 
squalene, which nucleotide molecule encodes a 
polypeptide having an amino acid sequence specified by 

10 SEQ ID NO:2- 

3. An isolated nucleotide molecule encoding 
sgualene synthetase selected from the group consisting 
of: 

15 (a) isolated nucleotide molecules consisting 

essentially of the sequence specified by SEQ ID NO:l; 

(b) isolated nucleotide molecules which 
hybridize to an isolated nucleotide molecule of (a) 
above under conditions represented by a wash stringency 

20 of 0.3M NaCl, 0 . 03M sodium citrate, and 0,1% SDS at 
60^ C, which are at least about 95% homologous to 
isolated nucleotide molecules of (a) above and which 
encode squalene synthetase; and 

(c) isolated nucleotide molecules which 
25 differ in sequence from the isolated nucleotide 

molecules of (a) and (b) above due to the degeneracy of 
the genetic code, and which encode squalene synthetase. 

4 . The nucleotide molecule of Claim 1 in the 
30 form of DNA. 

5. The nucleotide molecule of Claim 2 in the 
form of DNA. 



i- 
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6. The nucleotide molecule of Claim 3 in the 
form of DNA. 

7. The nucleotide molecule of Claim 5 
5 isolated from a Nicotiana species. 

8. The nucleotide molecule of Claim 6 
isolated from a Nicotiana species. 

10 9. The nucleotide molecule of Claim 4 

isolated from Nicotiana benthaxnisLna . 

10. The nucleotide molecule of Claim 5 
isolated from Nicotiana benthamiana . 



15 



11. The nucleotide molecule of Claim 6 
isolated from Nicotiana benthamiana . 



.9e09393AlJ_> 
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